Trading based on classification and regression trees
نویسنده
چکیده
This thesis investigates whether stock picking based on classification and regression trees can be implemented as a successful algorithmic trading system, if only based on technical analysis. To evaluate the performance of this method a fictional portfolio was constructed from the Stockholm Stock Exchange OMX30, traded on a five-year period. By means of implementation, classification of the assets in the portfolio was initially conducted. By using threshold values of the weekly returns and comparison with the index of the portfolio, every asset was classified as either outperforming, neutral or underperforming. With a satisfactory classification, each asset that is considered as outperforming is held over a period of one week and at the end of the period the position is terminated and a rebalancing of the portfolio is made. If no assets are classified as outperforming, the portfolio is liquidated and invested at a risk-free rate, defined as the STIBOR 1 week rate. When backtesting the model we find that the hit ratio of the overall classification is slightly larger than 50 %. During backtesting over the complete trading period it is found that an immense increase of portfolio value is generated. However, since the model is used in sample no predictive validity outside the range can be made. For this reason, 10-fold cross-validation and resubstitution techniques are employed in order to increase the validity if used in an out-of-sample test. Further, a rolling Sharpe ratio is introduced to evaluate the risk-adjusted returns for both portfolios and it is found that the rebalanced portfolio exhibits greater values. It is concluded that algorithmic trading based on classification and regression trees can be effective in finding patterns that influence the stock prices and that it can form the foundation for an algorithmic trading system. Acknowledgements I would like to thank Karl Hallberg at Nordea for sharing his experience as a quantitative analyst and for his knowledge in algorithmic models. I would also like to thank Professor Boualem Djehiche, my supervisor at the Royal Institute of Technology (KTH), for his efforts and experience of portfolio theory. Further, I would like to thank Gert Engman for providing me with invaluable comments regarding my thesis. Moreover, I would like to thank Johan Obermayer for our discussions regarding mathematical statistics and portfolio theory. Finally, I would like to thank my family for their support as well as my friends at the Royal Institute of Technology for making my time there a great and memorable experience.
منابع مشابه
Factors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis
Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...
متن کاملPredicting The Type of Malaria Using Classification and Regression Decision Trees
Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...
متن کاملAssessing Behavioral Patterns of Motorcyclists Based on Traffic Control Device at City Intersections by Classification Tree Algorithm
According to the forensic statistics, in Iran, 26 percent of those killed in traffic accidents are motorcyclists in recent years. Thus, it is necessary to investigate the causes of motorcycle accidents because of the high number of motorcyclist casualties. Motorcyclists' dangerous behaviors are among the causes of events that are discussed in this study. Traffic signs have the important role of...
متن کاملRanking the Trading Symbols of the Largest Companies Listed in the Tehran Stock Exchange Based on the Probability of Informed Trade Criteria
I n this paper, trading symbols of the 30 largest companies listed in the Tehran Stock Exchange (TSE) were ranked based on the asymmetry information risk. Using the Ersan and Alici (2016) modified clustering algorithm (EA), we estimated the probability of informed trading (PIN) to measure the asymmetry information among traders for each trading symbol and trading day through two-year...
متن کاملApplication of classification trees-J48 to model the presence of roach (Rutilus rutilus) in rivers
In the present study, classification trees (CTs-J48 algorithm) were used to study the occurrence of roach in rivers in Flanders (Belgium). The presence/absence of roach was modelled based on a set of river characteristics. The predictive performance of the CTs models was assessed based on the percentage of Correctly Classified Instances (CCI) and Cohen's kappa statistics. To find the best model...
متن کامل